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Abstract 

Background: Genome-wide linkage studies have identified the 9q22 chromosomal region as linked with colorectal 
cancer (CRC) predisposition. A candidate gene in this region is transforming growth factor (3 receptor 1 (TGFBR1). 
Investigation of TGFBR1 has focused on the common genetic variant rs1 1466445, a short exonic deletion of nine 
base pairs which results in truncation of a stretch of nine alanine residues to six alanine residues in the gene 
product. While the six alanine (*6A) allele has been reported to be associated with increased risk of CRC in some 
population based study groups this association remains the subject of robust debate. To date, reports have been 
limited to population-based case-control association studies, or case-control studies of CRC families selecting 
one affected individual per family. No study has yet taken advantage of all the genetic information provided by 
multiplex CRC families. 

Methods: We have tested for an association between rs1 1466445 and risk of CRC using several family-based statistical 
tests in a new study group comprising members of non-syndromic high risk CRC families sourced from three familial 
cancer centres, two in Australia and one in Spain. 

Results: We report a finding of a nominally significant result using the pedigree-based association test approach (PBAT; 
p = 0.028), while other family-based tests were non-significant, but with a p-value < 0.1 0 in each instance. These other 
tests included the Generalised Disequilibrium Test (GDT; p = 0.085), parent of origin GDT Generalised Disequilibrium 
Test (GDT-PO; p = 0.081) and empirical Family-Based Association Test (FBAT; p = 0.096, additive model). Related-person 
case-control testing using the "More Powerful" Quasi-Likelihood Score Test did not provide any evidence for association 
(M QLS ;p = 0.41). 

Conclusions: After conservatively taking into account considerations for multiple hypothesis testing, we find little 
evidence for an association between the TGFBR1*6A allele and CRC risk in these families. The weak support for 
an increase in risk in CRC predisposed families is in agreement with recent meta-analyses of case-control studies, 
which estimate only a modest increase in sporadic CRC risk among 6* A allele carriers. 
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Background 

Several genome-wide studies [1-3] have provided evidence 
for significant genetic linkage between a chromosomal re- 
gion on 9q22 and an increased risk of colorectal cancer 
(CRC). A further study confirmed this linkage signal and 
fine-mapped the association to a region centred around 
98.15 Mb [4]. Biologically, this chromosomal region houses 
several interesting candidate CRC susceptibility genes in- 
cluding PTCH1, XPA, GALNT12 and TGFBR1 [5]. Follow 
up efforts have particularly focused on TGFBR1 (hgl9 coor- 
dinates, chr9:10L87-101.92 Mb), but with largely inconclu- 
sive results [3,5-7]. 

The transforming growth factor p receptor type 1 
(TGFBR1) gene is an attractive candidate as TGF-p sig- 
nalling plays an important role in the control of a range 
of biological functions associated with colon carcinogen- 
esis including tissue homeostasis, angiogenesis, inflam- 
mation, proliferation and cellular differentiation and has 
and has also been implicated in both the suppression 
and promotion of CRC (see [1] for a recent review). 
On binding of the TGF-p ligand to TGFBR1, this 
serine/threonine protein kinase-containing receptor 
forms a heteromeric complex with type II TGF-p re- 
ceptors thereby transducing the TGF-p signal from the 
cell surface to the cytoplasm. A common variant of 
TGFBR1, rs 11466445 (heterozygote frequency 0.211; 
dbSNP135), contains a deletion of three GCG triplets 
from the sequence of exon 1, resulting in the expres- 
sion of a mutant receptor protein with six consecutive 
alanine (TGFBRP6A) rather than nine consecutive 
alanine (TGFBRP9K) residues. This is a hypomorphic 
mutation encoding a TGFBR1 variant protein with re- 
duced TGF-p growth inhibition-signalling activity. The 
TGFBRP6A allele has been proposed to act as a low- 
penetrance susceptibility allele for a number of malig- 
nancies [8], perhaps acting by decreasing TGFBR1 
allelic expression. Allele specific expression (ASE) of 
TGFBR1 in peripheral blood lymphocytes has been ob- 
served, with decreased expression associated with the *6A 
allele and two other SNPs in linkage disequilibrium [9]. 
Another study examined SNPs in the 3' untranslated re- 
gion of TGFBR1 and found that 29 of 138 patients with 
MSI-negative CRC showed ASE, with 14 of the 29 (48%) 
having a *6A/*9A genotype and clear enrichment of ASE in 
familial cases [10]. 

Although some studies have suggested that the 
TGFBRP6A allele confers an elevated risk of colorec- 
tal cancer [5,8,11], most studies have not found such 
an association [12-17]. A recent large meta-analysis of 
rsl 1466445 and colorectal cancer risk assessed nine 
association studies totalling 6,765 CRC patients and 
8,496 unrelated controls and found that heterozygous *6A/ 
*9A carriers showed a significantly increased risk of CRC 
with a pooled odds ratio (OR) of 1.12 (95% CI = 1.02-1.23; 



p = 0.013) compared to homozygous *9A/*9A carriers [18]. 
A further recent meta-analysis, which included 15 sub- 
groups (7,154 case and 8,851 controls), did not find an as- 
sociation with CRC with overall significance (OR = 1.085, 
95% CI = 0.963, 1.222; additive model), but instead found a 
significant association with breast and ovarian cancer. The 
difference from the previous meta-analysis was the exclu- 
sion of one study and the inclusion of two further studies 
[19]. One of the included studies genotyped rsl 1466445 in 
a Spanish cohort somewhat enriched for familial cancer, 
with -15% of cases having an affected first-degree relative 
and found it to be borderline significant with diagnosis of 
CRC (p = 0.0491; 515 cases, 515 controls) [5]. In the con- 
text of familial CRC in particular, two studies have exam- 
ined families with genetic predisposition [15,20]. In both 
studies, a case-control design was used - drawing on 
only one affected member from each family and com- 
paring this group with unrelated controls. In each 
instance, TGFBRP6K was not found to be associated 
with an increased familial colorectal cancer risk. Inter- 
estingly, a further study found evidence that the 
TGFBRP6K allelic frequency is higher amongst famil- 
ial CRC patients with mismatch-repair (MMR) nega- 
tive disease [21]. 

There have been no reports to date that have explored 
the likelihood of an association of TGFBRP6A with 
hereditary CRC using any family-based association test 
(FBAT) [22-24], or family-based case-control test de- 
signed for related individuals [25]. The family of FBATs 
examine associations within family groups and so are ro- 
bust to population stratification, a known confounder of 
case-control studies [24]. It has been suggested this ro- 
bustness comes at some cost. Simulations show that 
classical FBATs are less powerful than case-control tests 
[24,26], as the latter examine between-family associa- 
tions instead of exclusively within-family associations. 
Counter to this argument, the groups of affected rela- 
tives sampled from multiplex families should have more 
power to detect an association due to the higher than 
expected frequency of susceptibility alleles, compared 
with affected individuals having sporadic disease [25]. It 
is also possible to use quasi-likelihood score (QLS) tests, 
an alternative class of tests to FBATs with different the- 
oretical underpinnings. As opposed to within-family 
tests, these are between-family case-control tests that 
can account for the correlation between individuals in 
families [25]. 

We recently completed a new genome-wide linkage study 
[27] using non-syndromic CRC families from three distinct 
regions in Australia and Spain. One of the linkage regions 
of interest identified in that study was located on chromo- 
some 9q, proximal to the previously reported 9q22 linkage 
region, which contains the TGFBR1 locus. We genotyped 
an expanded set of families for rsl 1466445 and used FBATs 
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and the "More Powerful" Quasi-Likelihood Score Test 
(Mqls) to test for association with diagnosis of colorectal 
neoplasia (i.e. either colorectal adenocarcinoma or 
advanced adenoma). We report that after applying 
several family-based association tests we only found a 
nominally significant result using the PBAT rapid algo- 
rithm (p = 0.028), with another three FBAT algorithms 
all non-significant, but each yielding a p-value < 0.10. 
There was no evidence of an association using the 
Mq L s case-control model (p = 0.41). 

Methods 

Ethics statement 

The study was reviewed and approved by the Human 
Research Ethics Committees of the three participating 
centres: Flinders Medical Centre, Adelaide, The Royal 
Melbourne Hospital, Melbourne and Institut Catala 
d'Oncologia, Barcelona, with informed consent obtained 
from all participants. 

Family members 

A total of 414 individuals (172 males and 242 females), 
from 146 CRC families were recruited from clinics in 
Melbourne, Adelaide and Barcelona and informed con- 
sent was obtained from all participants. We restricted 
our study to non-syndromic high risk CRC families, de- 
fined as those containing at least one affected person 
who has one or more first-degree affected relatives and 
where the known causal mutations had been excluded. 
In each case, the diagnosis was confirmed by medical and 
pathology reports. FAP and MUTYH were excluded clinic- 
ally and HNPCC or Lynch syndrome was excluded by test- 
ing for microsatellite instability (MSI) (as measured by 
tumour-associated length variation in microsatellites BAT- 
25 and BAT-26) and/or immunohistochemistry indicating 
loss of hMLHI, hMSH2, hMSH6 and hPMS2 encoded pro- 
teins. Affected status was defined as diagnosis with either 
colorectal adenocarcinoma (CA) or one or more advanced 
adenomas (AA), where AA was defined as three or more 
synchronous or metachronous adenomas and/or adenoma 
(s) with villous morphology, and/or with severe dysplasia, 
and/or diameter > 10 mm. Diagnoses were confirmed by 
pathology reports. 

Unaffected individuals were family members who were 
either over 70 years of age with no history of CA or AA 
or were 50 years of age or older and had, within the last 
5 years, recorded a colonoscopy result negative for neo- 
plasia. As the age of onset is fairly late with a mean age 
of onset is 55.4 years (Table 1), the cohort is mostly sib- 
ships with missing parental genotypes. However, there is 
inclusion of some extended pedigrees of up to four gen- 
erations (including non-genotyped founders) containing 
parent-child, avuncular or cousin pairs. We reclassified 
11 young "unaffected" people and those with previous 



Table 1 Participant characteristics and demographics 



Participant characteristics 
Number of individuals 

Total including founders 759 

Genotyped subjects 414 

Flinders centre for cancer prevention and control 202 

The Royal Melbourne Hospital 165 

Institut Catala d'Oncologia 33 

Peter MacCallum Cancer Centre 14 
Family structures 

Number of families 147 

Singletons 24 

2 generations (nuclear) 87 

3+ generations (extended) 36 

Trios 25 
Traits of genotyped subjects 

Total Affected 180 

Colorectal adenocarcinoma 145 

Advanced adenoma (s) 35 

Unaffected relatives 223 

Unknown affection 1 1 

Relative Pairs 208 

Sib 169 

Cousin 1 

ParentChild 15 

Avuncular 19 

Other 4 

Female 58.45% 
Age, mean +/- SD 56.4 ± 1 2.6 
Affected Age, mean +/- SD 55.4 ± 1 2.5 
Unaffected Age, mean +/- SD 57.2 ± 1 2.7 



detection of colorectal polyps as "unknown" in accord- 
ance with our previous work [27,28]. Of these 11 people, 
three were heterozygous *6A/*9A genotype, six had the 
common *9A/*9A genotype and two the rare *6A/*6A 
genotype. When affecteds are misclassified as unaf- 
fecteds, family-based tests that make use of discordant 
information lose power [29], so it is sensible to reclassify 
particularly young unaffecteds as having unknown pheno- 
type. All people in the study had their age at blood draw 
recorded. 

Genotyping 

The TGFBR1 rs 11466445 variant status was determined 
by PCR amplification using primers Fwd 5'-GAGGC 
GAGGTTTGCTGGGGTGAGG-3' and Rev 5'-CATGT 
TTGAGAAAGAGCAGGAGCG-3'. PCR amplification 
was performed in a 25 \iL reaction containing 50 ng 
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genomic DNA using the Platinum Taq DNA polymerase 
with the addition of 3 x enhancer solution and followed 
the manufacturers protocol for GC-rich fragments 
(Invitrogen). Amplified fragments were separated by 
electrophoresis on a 10% polyacrylamide gel (Biorad) 
post-stained with gel-red (Jomar Diagnostics). Genotypes 
were assigned according to fragment sizes. A product size 
of 121 bp corresponded to the most common allele, *9A, 
whereas a product size of 112 bp corresponded to the *6A 
allele (Figure 1). 

Statistical analyses 

Testing for deviation from Hardy- Weinberg equilibrium 
and Mendelian inconsistencies was performed using 
Pedstats [30]. The Generalised Disequilibrium Test 
(GDT) V0.1.1 software [24] was used to test for associ- 
ation in dichotomous relative pairs with identity-by- des- 
cent (IBD) statistics estimated using Merlin Vl.1.2 [31] 
Generalised Family-Based Association Tests (FBAT) were 
undertaken in the FBAT V2.0.4 beta software [22] and 
PBAT version 3.6 software [23]. FBAT was set to calculate 
empirical variance estimates and to use a null hypothesis of 
linkage and no association and p-values were generated 
from the asymptotic Normal distribution. For PBAT, we 
used the rapid algorithm, a null hypothesis of linkage and 
no association with sandwich variance estimation and 
p-values were generated using an empirical permutation- 
based method with 10,000 replicates. Sandwich estimation 
was also used to estimate the correlation between members 
of larger pedigrees. For time-to-onset analysis, the 
Wilcoxon Logrank FBAT statistic was examined. 



TGFPR1 Genotypes 



J2 C 




Figure 1 Genotyping example. In an electrophoresis gel, the 

TGFBR1*6A allele migrates as a 1 12 bp species and the TGFBR1*9A 

allele migrates as a 121 bp species. Examples of homozygotes and 

heterozygotes of the two alleles are shown. 
\ J 



For case-control testing the "More Powerful" Quasi- 
Likelihood Score Test (M QL s) was used [25]. The M QLS , 
an improvement on the quasi-likelihood score test W QLS 
[32], is a case-control test for allelic association that condi- 
tions on the pedigree structure using unconditional cor- 
rected variance to account for the relatedness amongst 
individuals. The M QL s can incorporate unaffected controls 
and controls of unknown affection state. It also makes use 
of the affection state of relatives with missing genotype data 
by using their affection status to weight the family. The 
rationale being that an affected person who has additional 
affected relatives is more likely to be carrying a genetic 
predisposition. 

Accounting for linkage 

The use of null hypotheses of "linkage and no association" 
in the FBAT and PBAT software was conservative. While 
the families show genetic linkage with cancer diagnosis in a 
region of chromosome 9 (9q33.3-9q34.3; non-parametric 
LOD = 2.24) with a 1-LOD support interval of -127.97- 
140.0 Mb [27], this does not cover the location of the 
TGFBR1 locus at 101.9 Mb and this region is only weakly 
linked with CRC. At the SNP rs928180, which is in the 
TGFBR1 intragenic region, the non-parametric (S a u) LOD 
score is 0.293. By using IBD information it is possible to 
control for linkage using the GDT. Unlike the FBATs, the 
Mqls case-control test does not control for linkage and al- 
lows both linkage and association to contribute to the test 
statistic. 



Results 

Genotyping and quality control 

We found 315, 95 and four people to be homozygous 
for the rs 11466445 *9A allele, heterozygous and homo- 
zygous for the *6A allele, respectively, with a *6A allele 
frequency (AF) of 0.124. The four people carrying the 
*6A/*6A genotype were dispersed across two families, 
each having one discordant pair (one affected and one 
unaffected individual). An exact test found the genotype 
to be in Hardy- Weinberg equilibrium (all individuals, 
p = 0.3687; 126 unrelated individuals, p = 1.0) and there 
were no observed Mendelian inconsistencies. The allele 
frequencies of the *6A allele in the affected and un- 
affected family members (affected family member, AF = 
0.117, unaffected family member, AF = 0.130) were 
slightly higher than observed in a case-control British 
study of hereditary CRC (913 cases, AF = 0.096; 828 
controls, AF = 0.100) [15] and a further Swedish Cauca- 
sian cohort with hereditary non-polyposis colorectal 
cancer (HNPCC) and non-HNPCC hereditary CRC pa- 
tients (83 HNPCC + 179 non-HNPCC cases, AF = 0.107; 
controls, AF = 0.106) [20]. 
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Family-based association testing 

In the first instance, we tested for an association with 
colorectal neoplasia using the Generalised Disequilib- 
rium Test (GDT). Given the large differences in pedigree 
sizes in this present study and the high number of pos- 
sible intra-pedigree discordant pairings between geno- 
typed people, the generalised relative pairs weighted by 
family size approach implemented by the GDT software, 
provides a good fit with the data. One caveat of using 
discordant pairs, however, is that in complex disease 
some people inheriting a risk allele do not develop 
the disease, or develop it rather late in life and this needs 
to be taken into account. As some of the pedigrees are 
multi-generational, we used inheritance by descent 
(IBD) data to inform the GDT analysis. Testing for asso- 
ciation between the rsl 1466445 *6A allele and colorectal 
neoplasia in 208 discordant relative pairs by the GDT al- 
gorithm produces a p-value of 0.085 (Table 2). Inclusion 
of gender as a covariate did not change the p-value. 
While this result is not significant at a 5% level, given 
the borderline p-value and to avoid false negative results, 
we further tested the association using other family- 
based association methods that construct a test with dif- 
ferent assumptions and/or make use of different group- 
ings of related people within the data. For this, we ran 
a parent of origin GDT test and also the tests imple- 
mented in the FBAT and PBAT software. 

The GDT software allows analysis to be constrained to 
only examine discordant parent-child pairs (GDT-PO) 
and ignore unaffected sibling data. This parent of origin 
test for the 15 parent-child pairs in the study was con- 
sistent with the full GDT result (p = 0.081; Table 2). 
Next, we tested the association using the Family-based 
association test (FBAT), a statistic that examines the co- 
variance between phenotype and allele transmission 
(Mendelian residuals) from parents to offspring. Consid- 
ering there are only four homogyzous *6A carriers we 



did not test the recessive genetic model. As the variant 
falls in an area of weak genetic linkage, FBAT empirical 
variance estimates were used to control for correlation 
amongst sibling genotypes within pedigrees. The FBAT 
result was non-significant (Table 2). Given the large 
number of missing parents in the current study and only 
having 22 (additive model) or 23 (dominant model) in- 
formative nuclear families, there is some reliance upon 
the sufficient statistic and large sample theory. Regard- 
less, the p-value for the *6A allele under an additive 
model (p = 0.096) is close to that obtained with the GDT 
(p = 0.081), which uses a robust measure not dependent 
upon large sample theory. 

Finally, we tested for an association under an additive 
model with the *6A allele using the FBAT implemented 
in the PBAT software. Using the PBAT rapid algorithm, 
the association was found to be nominally significant 
under an additive model (p = 0.0278; 10,000 permuta- 
tions) with a null hypothesis of linkage and no associ- 
ation, with robust sandwich variance estimates (Table 2). 
We also tested for an association between the *6A allele 
and age of CRC diagnosis, but found no evidence (addi- 
tive model, FBAT-Wilcoxon, p = 0.150, null hypothesis - 
linkage, no association with sandwich variance). 

Case-control testing 

We used the "More Powerful" Quasi-Likelihood Score 
Test (Mq LS ) which accounts for relatedness between 
subjects using a corrected variance. Unlike the FBATs, 
the Mq LS can make use of the genotyping information 
of the 24 singletons in the study and can use the people 
with unknown affection status as controls. The result 
was insignificant (Table 2), with a p-value of 0.41 (180 
cases, 137 controls) and specifying a disease prevalence 
of 0.05. The result was highly insensitive to specifying 
other disease prevalence values and setting prevalence to 
0.001, 0.1 and 0.2 gave p-values of 0.39, 0.41 and 0.44, 



Table 2 Association results 



Model 


Allele 


Allele Freq 


Informative units 


P-value* 


Statistic 


GDT 






Relative Pairs 






Allelic 


6 


0.124 


208 


0.085 


1.724 


Parent of origin GDT 






Parent-child pairs 






Allelic 


6 


0.124 


15 


0.081 


1.746 


FBAT (empirical) 






Nuclear families 






Additive 


6 


0.115 


22 


0.096 


1.665 


Dominant 


6 


0.115 


23 


0.193 


1.301 


PBAT (rapid) 






Trios 






Additive 


6 


0.114 


26 


0.028 




MQLS 






Cases/Controls 






Allelic 


6 


0.124 


180/137 


0.41 


0.82 



*- All p-values rounded to 3 decimal places; nominally significant p-value is in bold. 
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respectively. Unlike FBATs, the M QLS is not robust to 
population heterogeneity and will inflate type I error 
rates (the incorrect rejection of a true null hypothesis) in 
instances of population stratification. Given the convin- 
cingly non-significant result we did not investigate this 
further. 

Discussion 

Testing with the rapid PBAT algorithm gave a nominally 
significant result under an additive genetic model (p = 
0.028). Under the GDT, GDT-PO and FBAT approaches 
we did not find a significant association, but all the p- 
values were consistently borderline, with p-values < 0.10. 
Unlike the FBAT approaches, we found no evidence of 
an association using M QLS , a case-control method that 
corrects for relatedness amongst subjects (p = 0.41). 

The differences in p-values between the methods, 
under the same hypotheses and genetic models are due 
to the formulation of the test and also the treatment of 
family structures, which leads to differences in groupings 
informative for the test statistic. 

The GDT, a robust generalisation of the intuitively 
simple transmission-disequilibrium test (TDT), exam- 
ines transmission disequilibrium between pairs of dis- 
cordant relatives. The variant GDT-PO test, considers 
only parent-child discordant pairs. As relatively few par- 
ent-child pairs were genotyped in this study, the test 
will have much reduced power. However, given the age 
of the parents, the result should be more robust to misspe- 
cification of phenotype. The FBAT and PBAT algorithms 
are highly related and examine transmission disequilibrium 
from parents to affected offspring. 

For the FBAT statistic, informative families are those 
with at least one parent heterozygous for the two 
TGFBR1 alleles and having affected offspring. The use of 
only affected offspring in the association statistic makes 
it robust to phenotype misspecification of affected 
people as unaffected. In the case of a missing parent, or 
parents, the test conditions on the sufficient statistic for 
the genotype distribution in each family; where a parent 
genotype is expressed as a set of likelihoods conditioned 
upon known offspring genotype (s). The design of the 
FBAT necessitates that extended pedigrees are split into 
nuclear families, which can introduce bias due to correl- 
ation. The FBAT also requires specification of the gen- 
etic model. The PBAT rapid algorithm differs from 
FBAT in that extended pedigrees are broken up into 
clusters of trios who share the same parents. The rapid 
algorithm in PBAT tests only the minor alleles and offers 
the ability to generate p-values using a robust Monte 
Carlo permutation based method instead of the asymp- 
totic Normal distribution [23]. It also offers time to on- 
set analyses with the same empirical p-values. Finally, 
the Mq LS test is very different to the others, and is a 



regression model rather than a family-based association 
test. It considers both within- and between-family asso- 
ciations using a linear regression of genotypes on affec- 
tion status with correlations for relatedness modelled as 
a kinship coefficient random effect. 

The closeness in p-values between all the family-based 
methods demonstrates the finding is not particularly 
sensitive to different assumptions underpinning these 
various algorithms. More broadly, given the difference in 
informative pairs/families in each FBAT method and na- 
ture of the algorithm, the general agreement across 
methods suggests this marginal evidence of an associ- 
ation is not a spurious result. 

However, the nominally significant PBAT result should 
perhaps be treated with some caution. While the p-value 
was generated using a robust empirical Monte-Carlo 
based method, it is possible the partitioning of people 
into clusters of trios may inflate type I error. As the 22 
informative nuclear families are broken into 26 inform- 
ative clusters of trios, there is a degree of correlation be- 
tween some clusters that is unaccounted for by the 
approach. Conversely, there is reason to think that such 
correlation may not greatly influence the p-value. The 
similarity in p-values between the GDT and the FBAT, 
which also splits extended pedigrees, suggests the differ- 
ence in handling of extended pedigree structure between 
the methods did not overly affect the association test re- 
sult in this instance. 

In essence, a method (PBAT) which examines transmis- 
sion to affected relatives but breaks pedigrees up for com- 
putational reasons is significant, while a method (GDT) 
that examines discordant relative pairs that does not adjust 
pedigree structure is non-significant. It is unclear how the 
different pairings or pedigree structure between these 
methods is contributing to the difference in p-value. In sim- 
ulations, the GDT test was found to have more power over 
the FBAT in the majority of nuclear family and extended 
pedigree structures tested [24]. However, nuclear families 
with two missing parents (which include most of this 
present cohort) were not simulated, so it is possible the 
FBAT implemented in the PBAT software is more powerful 
in this scenario and may help explain the lower p-value. 

Only one SNP was tested for association with CRC, 
however, the genotype data was reformulated into sev- 
eral tests and genetic models with different treatment of 
the genotype data and family structures. Correction for 
these multiple tests can be applied, but such correction 
assumes independence of the tests. As these tests are 
very dependent, such a correction is highly conservative. 
Given the number of tests made of this single hypoth- 
esis, the PBAT association will become non-significant 
after correction for multiple tests. 

All reported association studies of rs 11466445 have 
been of a case-control design [18]. Even studies that 
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have gathered affected cases from families with an inher- 
ited predisposition have used a case-control design, with 
one case selected from each family and compared to un- 
related controls [15,20]. To our knowledge, this present 
study is the first to examine the association between 
rsl 1466445 and colorectal cancer using family-based as- 
sociation statistics or case-control methods for related 
individuals. Our study is designed to examine the associ- 
ation of rsl 1466445 with colorectal neoplasia diagnosis 
within families predisposed to CRC. The within-family 
approach frees the analysis from concerns about popula- 
tion stratification. 

Collectively, current evidence suggests the *6A allele is 
a relatively minor contributor to CRC prevalence. A 
modest 12% and 8.5% increase in CRC risk, respectively, 
was found by two meta-analyses across large populations 
[18,19]. The first meta-analysis found the TGFBR1 *6A 
allele to be significantly associated with CRC while the 
latter did not. The authors of a recent review consider- 
ing the *6A allele association and ASE studies together 
concluded that the effect of the allele on CRC predispos- 
ition is, at best, very subtle [33]. 

Conclusions 

Our finding here of little evidence for an association 
with 6* A in CRC predisposed families supports the con- 
clusions of recent meta-analyses and a review, which 
find the effect of the 6* A allele on CRC risk is modest. 
This weak evidence for association, together with the 
modest linkage signal in the region of the TGFBR1 locus, 
suggests that rsl 1466445 does not contribute significantly 
to the collective genetic predisposition towards CRC in 
these families. 
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